ALP: Alleviating CPU-Memory Data Movement Overheads in Memory-Centric Systems

نویسندگان

چکیده

Partitioning applications between near-data processing (NDP) and host CPU cores causes inter-segment data movement overhead, which is caused by moving generated one segment (e.g., instructions, functions) used in other consecutive segments. Prior works take two approaches to this problem. The first approach maps segments NDP or based on the properties of each segment, neglecting overhead. second partitions overall memory bandwidth savings, does not offload best-fitting core if they incur high movement. We show that 1) mapping its ideally can provide substantial benefits, 2) reduces benefit significantly. introduce ALP, a new programmer-transparent technique alleviate overhead systems. ALP proactively accurately transfers required key observation instructions generate stay same across different executions program. uses compiler pass identify these specialized hardware transfer their produced at runtime. evaluate wide range workloads demonstrate 54.3% 45.4% average speedup over CPU-only NDP-only executions, respectively.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Memory-Centric Data Storage for Mobile Systems

Current data storage on smartphones mostly inherits from desktop/server systems a flash-centric design: The memory (DRAM) effectively acts as an I/O cache for the relatively slow flash. To improve both app responsiveness and energy efficiency, this paper proposes MobiFS, a memory-centric design for smartphone data storage. This design no longer exercises cache writeback at short fixed periods o...

متن کامل

Memory Controller Memory Controller CPU

In its most general form, interconnection networks are a central component of all computing and communication systems from the internal interconnects of chip-scale embedded architectures to geographic-scale systems such as wide area networks and the internet. This section focuses on interconnection networks as they are used in multiprocessor and multicore systems. Specifically, the section addr...

متن کامل

Reducing Memory Sharing Overheads in Distributed JVMs

Distributed JVM systems by supporting Java’s shared-memory model enable concurrent Java applications to run transparently on clusters of computers. Aiming to reduce the overheads associated to memory coherence enforcement mechanisms required in such distributed JVMs, we propose two new techniques, selective dynamic diffing and lazy home allocation. To evaluate their potential benefits, both tec...

متن کامل

Directory Memory CPU Cache Network Interface Directory Memory CPU Cache Network Interface Directory Memory CPU Cache Network Interface � � � � � � � �

Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE. Abstract Distributed Shared Memory (DSM) approach provides the illusion o...

متن کامل

Alleviating memory impairment through distraction.

Distraction typically has a negative impact on memory for recent events and patients with existing memory impairment are particularly vulnerable to distractor interference. In contrast, here we establish a beneficial effect for distractor presentation in humans for both patients with memory impairment due to bilateral hippocampal lesions and healthy adults with low memory performance. Recogniti...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Emerging Topics in Computing

سال: 2023

ISSN: ['2168-6750', '2376-4562']

DOI: https://doi.org/10.1109/tetc.2022.3226132